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Abstract 

We provide simplified proofs for the asymptotic distribution of tfie number of cuts required 
to cut down a Galton- Watson tree with critical, finite-variance offspring distribution, conditioned 
to have total progeny n. Our proof is based on a coupling which yields a precise, non-asymptotic 
distributional result for the case of uniformly random rooted labeled trees (or, equivalently, Pois- 
son Galton-Watson trees conditioned on their size). Our approach also provides a new, random 
reversible transformation between Brownian excursion and Brownian bridge. 

1 Introduction 

The subject of cutting down trees was introduced by Meir and Moon [34, 35]. One is given a rooted 
tree T which is pruned by random removal of edges. At each step, only the portion containing the 
root is retained (we refer to the portions not containing the root as the pruned portions) and the 
process continues until eventually the root has been isolated. The main parameter of interest is the 
random number of cuts necessary to isolate the root. (The dual problem of isolating a leaf or a node 
with a specific label has been considered by Kuba and Panholzer [28, 29].) 

The procedure has been studied on different deterministic and random trees. Essentially two 
kinds of random models have been considered for the tree: recursive trees with typical inter-node 
distances of order logn [18, 21, 22, 36], and trees arising from critical, finite variance branching 
processes conditioned to have size n, with typical distances of order ^/n [19, 23, 24, 38, 39]. In this 
paper, we are interested in the latter family, and will refer to such trees as conditioned trees for short. 

For conditioned trees emerging from a progeny distribution with variance ct^ e (0,oo), once 
divided by a^/n, the number of cuts required to isolate the root of a conditioned tree of size n 
converges in distribution to a Rayleigh random variable with density xe~^ on [0, oo). (In this 
form, under only a second moment assumption, this was proved by Janson [24]; below we discuss 
earlier, partial results in this direction.) The fact that the Rayleigh distribution appears here with 
a ^fn scaling in a setting involving conditioned trees struck us as deserving of explanation. The 
Rayleigh distribution also arises as the limiting distribution of the length of a path between two 
uniformly random nodes in a conditioned tree, after appropriate rescaling. 

In this paper we show that the existence of a Rayleigh limit in both cases is not fortuitous. We 
will prove using a coupling method that the number of cuts and the distance between two random 
vertices are asymptotically equal in distribution (modulo a constant factor cr^). This approach yields 
as a by-product very simple proofs of the results concerning the distribution of the number of cuts 
obtained in [19, 23, 24, 38]; this is explained in Section 6. 

At the heart of our approach is a coupling which yields the exact distribution of the number 
of cuts for every fixed n, for the special case of uniform Cayley trees (uniformly random labelled 
rooted trees). Given a rooted tree t and a sequence S = (v\^. . . ,Vk) of not necessarily distinct 
nodes of t, consider an edge removal procedure defined as follows. The planting oft at S, denoted 
t{S), is obtained from t as follows. For each 1 < i < k create a new node Wi whose only neighbour 
in t is Vi. Let W = {wi, . . . , Wk} be the set of new vertices (it may be more natural to take Vl^ as a 
sequence, since 5* is a sequence, but taking as a set turns out to be notationally more convenient 
later). For a subgraph t' of t{S) and a vertex v, we write C{v, t') for the connected component of 
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t' containing w; let also C{V,t') be the (minimal) set of connected components containing all the 
vertices in a set V . 

Let = t{S), and for j > 0, let be obtained from F'-J^ by removing a uniformly 

random edge from among all edges of C{W, if there are any such edges. The procedure stops 

at the first time j at which C{W, F^-'^) simply consists of the set of new vertices {wi, . . . , Wk}- We 
call this procedure planted cutting of S in t. We remark that Janson [23] already introduced the 
planted cutting procedure in the case fc = 1. Note that if t is a rooted tree with root r, then t{{r}) 
contains only one node which is not a node of t, and in this case the cutting procedure is almost 
identical to that described in the first paragraph of the introduction (see, however, the remark just 
before Theorem 3.1). Write M = M{t, S) for the (random) total number of edges removed in the 
above procedure. We remark that for each < i < M, F*^*^ has i + I connected components, each 
of which is a tree. 

Theorem 1.1. Fix n > 1 and k > 1, let Tn be a uniform Cayley tree on nodes [n] — {1, . . . , n}, 
let Vi, . . . ,Vk be independent, uniformly random nodes o/T„, and write Sk = [Vi, . . . , Vk). Then 
M{Tn, Sk) — k is distributed as the number of edges spanned by the root plus k independent, 
uniformly random nodes in a uniform Cayley tree of size n. 

For fc > 1, let Xk be a chi random variable with 2fc degrees of freedom; the distribution of Xk is 
given by 

Corollary 1.2. For any fixed k, as n ^ oo, {M(Tn, Sk))/\/n converges to Xk in distribution and 
in U' for any p € [1, oo). 

Remarks 

★ Corollary 1.2 follows immediately from Theorem 1.1 and easy combinatorial considerations (or 
see, e.g., Aldous [6], Lemma 21). 

-k In the special case fc = 1, Theorem 1.1 states that the number of edges required to isolate the 
planted node in a planted uniform Cayley tree of size n, is identical in distribution to the number of 
vertices on the path between two uniformly random nodes in a uniform Cayley tree of size n. For 
the case fc = 1, Chassaing and Marchand [16] have also announced a simple bijective proof of this 
result, based on linear probing hashing. 

★ We note that Bertoin [11] has used powerful recent results of Haas and Miermont [20] to establish 
the distributional convergence in Corollary 1.2. 

★ The original analyses by Meir and Moon [34] include asymptotics for the mean and variance of 
the number of cuts. In recent years, the subject of distributional asymptotics has been revisited by 
several researchers. Panholzer [38] and Fill, Kapur, and Panholzer [19] have studied the somewhat 
simpler case where, the laws of the trees (as n varies), satisfy a certain consistency relation. More 
precisely, if /i„ is the law of the n-vertex tree, the consistency condition requires that after one step 
of the cutting procedure, conditional on the size fc of the pruned fragment, the pruned fragment and 
the remaining tree are independent, with respective laws fik and jin-k- The class of random trees 
which satisfy this property includes uniform Cayley trees. For this class, they obtained the limiting 
distribution of various functionals of the number of cuts using the method of moments, and gave an 
analytic treatment of the recursive equation describing the cutting procedure. Janson [23, 24] used 
a representation of the number of cuts in terms of generalized records in a labelled tree to extend 
some of these results to all the family trees of critical branching processes with offspring distribution 
having a finite variance. His method is also based on the calculation of moments. 

In the case fc = 1, our coupling approach also allows us to describe the joint distribution of the 
sequence of pruned trees. In this paper, a. forest is a sequence of rooted labelled trees f = (ti , . . . , tj) 
with pairwise disjoint sets of labels. In the notation of Theorem 1.1 and of the paragraph which 
precedes it, write M = M{Tn, Si) and write {T^^\ . . . ^T^^*^)) for the connected components of 
listed in the order they are created during the edge removal procedure on T'„(S'i). Note that 
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the edge removal procedure stops at the first time that wi is isolated, so necessarily r(*^) consists 
simply of the single vertex wi. For each 1 < i < M, T^'^'> is a tree, which we view as rooted at 
whichever node of T*^') was closest to wi in T„(5'i); in particular, necessarily is rooted 

atFi. 

Theorem 1.3. The forest (T'^^\ . . . , T*^*^~^)) is distributed as a uniformly random forest on [n]. 

The analysis which leads to Theorem 1.3 will also yield as a byproduct the following result. 

Theorem 1.4. Let = (Ti, . . . , T„) be a uniformly random forest on [n]. For each i G [k — 1], 
add an edge from the root ofFi to a uniformly random node from among all nodes in T^+i, . . . , T^. 
Call the resulting tree F, and view F as rooted at the root q/T^. Then F is distributed as a uniform 
Cayley tree on [n]. 

It turns out that our coupling approach allows us to prove results about a natural "continuum 
version" of the random cutting procedure which takes place on the Brownian continuum random 
tree (CRT). Our main result about randomly cutting the CRT is Theorem 5.1, below. Although 
we work principally in the language of M-trees, Theorem 5.1 can be viewed as a new, invertible 
random transformation between Brownian excursion and reflecting Brownian bridge. Though the 
precise statement requires a fair amount of set-up, if this set-up is taken for granted the result can be 
easily described. (For the reader for whom the following three paragraphs are opaque, all the below 
terminology will be re-introduced and formally defined later in the paper) 

Let [T ; d) be a CRT with root p and mass measure /i, write skel(T) for its skeleton, and let V be 
a homogeneous Poisson point process on skel(T) x [0, oo) with intensity measure £ ® dt, where £ 
is the length measure on the skeleton. We think of the second coordinate as a time parameter View 
each point (p, t) of as a potential cut, but only make a cut at p if no previous cut has fallen on 
the path from the p to p. At each time < t < oo, this yields a forest of countably many rooted 
M-trees; we write Ft for the component of this forest containing p. Run to time infinity, this process 
again yields a countable collection of rooted M-trees, later called {fi,i £ loo)- Furthermore, each 
element fi of the collection comes equipped with a time index (the time at which it was cut). 

For < t < c», let L{t) = /i(7^)ds, and let L{oo) = limj^oo L{t). It turns out that L{oo) is 
almost surely finite. Next, create a single compact M-tree (T', d') from the collection {fi,i E loo) 
and the closed interval [0, L{oo)] by identifying the root of fi with the point L{Ti) e [0, L(oo)], for 
each i e loo, then taking the completion of the resulting object. Let /i' be the pushOforward of p 
under the transformation described above. 

Theorem 1.5. The triples (T', d' , p') and (T, rf, p) have the same distribution. Furthermore, S 
F' and L{oo) G F' are independent and both have law p' . 

Using the standard encoding of the CRT by a Brownian excursion, we may take the triple 
(T, d, p), together with the point p, to be encoded by a Brownian excursion. Similarly, it is possi- 
ble to view the triple (T', d' , p'), together with the points and L{oo), as encoded by a reflecting 
Brownian bridge; see Section 10 of [7] (this is also closely related to the "forest floor" picture of 
[12]). From this perspective, the transformation from (T, p) to {Q, 0, Loo) becomes a new, random 
transformation from Brownian excursion to reflecting Brownian bridge. When expressed in the lan- 
guage of Brownian excursions and bridges, this theorem and our "inverse transformation" result. 
Theorem 1.7, below, have intriguing similarities to results from Aldous and Pitman [7]; we briefly 
discuss this in Appendix A. 

As an immediate consequence of the above development, we will obtain the following result. 
Let p{t) be the mass of the tagged fragment in the Aldous-Pitman [7] fragmentation at time t. 

Theorem 1.6. The random variable p{t)dt has the standard Rayleigh distribution. 

This result is restated in Corollary 5.7, below. A different proof of this fact appears in a recent 
preprint by Abraham and Delmas [1]. 

We are also able to explicitly describe the inverse of the transformation of Theorem 1.5, and we 
now do so. Let (T, d, p) be a measured CRT, and let p, p' be independent random points in F with 
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law /i. Let B be the set of branch points of T on the path from pXo p' . For each 6 G i? let 7b the the 
set of points a; S T for which the path from 6 to p contains a point b' G B with d{p, b') > d{p, b). 
In words, T' is the set of points in subtrees that "branch off the path from p to p' after 5." Then, 
independently for each point b E B, let yi, be a random element of %, with law p/p{Tb)- Delete all 
non-branch points on the path between p and p'; then, for each b E B, identify the points b and yi,. 
Write (T', d') for the resulting tree, and p' for the push-forward of ptoT' ■ 

Theorem 1.7. The triples (T,d,p) and {T',d',p') have the same distribution. Furthermore, the 
point p' G T' has law p' . 

We remark that it is not a priori obvious the inverse transformation should a.s. yield a connected 
metric space, let alone what the distribution of the resulting space should be. Theorems 1.5 and 1.7 
together appear as Theorem 5.1, below. 

Plan of the paper. In Section 2 we gather definitions and state our notational conventions. In 
Section 3 we prove all finite distributional identities related to the case fc = 1, in particular proving 
Theorems 1.3 and 1.4, and in Section 4 we prove Theorem 1.1. Our results on cutting the CRT, 
notably Theorem 5.1, appear in Section 5; finally, in Section 6 we explain how our results straight- 
forwardly imply the distributional convergence results obtained in [23, 24, 38]. 

2 Notation and definitions 

We note that the terminology introduced in Sections 2.2 and 2.3 is not used until Section 5, and the 
reader may wish to correspondingly postpone their reading of these sections. 

2.1 Finite trees and graphs 

Given any finite graph G, we write v{G) for the set of vertices (or nodes) of G and e(G) for the set 
of edges of G, and write |G| for the size (number of vertices) of G. If we say that G is a graph on 
S we mean that v{G) = S. Given a graph G and w £ v{G), we write C{w, G) for the connected 
component of G containing w. Given a graph G and S C V{G), we sometimes write G \ 5 for the 
graph(i;(G),e(G)\5). 

Practically all graphs in this paper will be rooted trees and be denoted t or T. When we write 
"tree" we mean a rooted tree unless we explicitly say otherwise. 

Given a rooted labeled tree t, we write r{t) for the root of t. For a vertex u of i write t{u) for the 
subtree of t rooted at u, write ht{u) for the number of edges on the path from r{t) to u, and write 
a{u) — a{u, t) for the parent of u in t, with the convention that a(r(t)) = r{t). At times we view 
the edges of t as oriented towards r{t). In other words, if we state that {u, v) is an oriented edge of 
t, or write {u, v) E e{t), we mean that {u, v} E e{t) and v = a{u). In this case we call u the tail of 
{u, v} and v the head of {u, v}. It is also sometimes useful to view r{t) as both the head and tail of 
a directed loop (r(t), r(t)); we will mention this again when it arises. 

Given a set S" = {vi, . . . , v^} of nodes of t, we write tlS} or . . . , Ufc] for the subtree 
of t obtained by taking the union of all shortest paths between elements of S, and call ifS} the 
subtree oft spanned by S. Given a single node v E t, we write t^^" to denote the tree obtained 
from t by rerooting at v. As mentioned in the introduction, in this paper an ordered forest is a 
sequence of rooted labelled trees f = (ti, . . . , t^) with pairwise disjoint sets of labels. If we write 
i = {ti, . . . , ifc) is an ordered forest on S we mean that v{ti) U • • • U v{ti.) = S. 

Given a finite set S, by a uniform Cayley tree on S we mean a rooted tree chosen uniformly at 
random from among all rooted trees t on S; there are |5| ''^'^^ such trees. Given a rooted or unrooted 
tree t, and an ordered sequence S — {vi, . . . , Vk) of elements of v{t), we recall the definition of t{S) 
(the planting oft at S) from the introduction: for each 1 < i < k, create a new node Wi and add a 
single edge between Wi and Vi. Given a set [/ C v{t{S)), we write \U\ for the number of nodes of 
U \ {wi, . . . , Wk}- In other words, the nodes wi, . . . ,Wk are not included when performing node 
counts in t{S). 



4 



2.2 Metric spaces and real trees 

In this paper all metric spaces are assumed to be separable. Given a metric space X = {X,d), 
and a real number c > 0, we write cX for the metric space obtained by scaling all distances by c. 
In other words, if x,y E X then the distance between x and y in cX is cd{x,y). We also write 
diam (X) — sup{d(a;, y) : x,y <E X} e [0, oo\. 

Given a metric space {X, d) and x^y E X, a. geodesic between x and y is an isometry / : 
[0, d{x, y)] — )■ X such that /(O) = x and f{d{x, y)) — y. In this case we call the image Im(/) a 
shortest path between x and y. 

A metric space T = (T, d) is an M-tree if for all x,y E T the following two properties hold: 

1. There exists a unique geodesic between x and y. In other words, there exists a unique isometry 
/ : [0, d{x, y)] T such that /(O) ^ x and y)) = y. 

2. If g : [Q,d{x,y)] — > T is a continuous injective map with 17(0) = x and g{d{x,y)) = y then 
f{[0,d{x,y)])=g{[0,dix,y)]). 

Given an M-tree (T, d) and a,b E T, we write Ja. bj for the image of the unique geodesic from a to 
b, and write ]a, 6|= |a, 6] \ {a, 6}. The skeleton skel(T) is defined as 

U 

a.beT 

(We could equivalently define skel(T) as the set of points whose removal disconnects the space.) 
Since (T, d) is separable this may be re-written as a countable union, and so there is a unique cr-finite 
measure ^ on T with £{]a, b[) = d{a, b) for all a, 6 e T and such that £{T \ skel(T)) = 0. We refer 
to £ as the length measure on T. 

For a set 5 C T, write T{S) for the subspace of T spanned by Uaj.ygsja;, ?/| and ds for its 
distance, and note that {T{S) , ds) is again a real tree. 

2.3 Types of convergence 

Before proceeding to definitions, we remark that not all the terminology of this subsection is yet fully 
standardized. The Gromov-Hausdorff distance is by now well-established. The name "Gromov- 
Hausdorff-Prokhorov distance" seems to have first appeared in [42], Chapter 27, where it had 
a slightly different meaning from above. The probabilistic aspects of the Gromov-Hausdorff- 
Prokhorov distance were substantially developed in [20, 37]. In particular, it is shown in [37], 
Section 6.1, that the below definition of dcHP is equivalent to a definition based on the more stan- 
dard Prokhorov distance between measures. 

Gromov-Hausdorff distance 

Let (X, dx) and (Y, dy) be metric spaces. The Gromov-Hausdorff distance dcuiX, Y) between X 
and Y is defined as follows. Let S be the set of all pairs ((/>, tji), where (j) : X Z and 1/) : F — > Z 
are isometric embeddings into some common metric space (Z, dz). Then 

dcn{X,Y)= inf dn{cp{X),i^{Y)), 
(</.,V)e5 

where dn denotes Hausdorff distance in the target metric space. It can be verified that dew is indeed 
a distance, and that writing M for the set of isometry-equivalence classes of compact metric spaces, 
(A^, dcM} is itself a complete separable metric space. We say that a sequence c?n) of compact 
metric spaces converges to a compact metric space (X, d) if dcn{Xm X) — )■ as n 00. It is then 
obvious that X is uniquely determined up to isometry. There are two alternate descriptions of the 
Gromov-Hausdorff distance that will be useful and which we now describe. 
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Next, for metric spaces {X, dx) and {Y, dy), and a subset C of X xY, the distortion dis(C) is 
defined by 

dis{C)=snp{\d{x,x')-d{y, y')\ : {x,y) G C, {x' ,y') e C}. 

A correspondence C between X and F is a Borel subset of X xY such that for every x & X, 
there exists y €Y with (x, y) E C and vice versa. Write 't^{X, Y) for the set of correspondences 
between X and Y. Given C e '^{X, Y), we then have 

doniX, y) = ^ inf {r : 3C G '€{X, Y) such that dis(C) < r} , 

and there is a correspondence which achieves this infimum. 

Given a correspondence C between X and Y and e > write 

C, = {{x,y) eXxY: 3{x',y') e C,dx{x,x') < e,dY{y,y') < e} 

and note that is again a correspondence, with distortion at most dis(C) + 2e. We call the e 

blow-up of C. 

Let (X, dx, (a^i, ■ • ■ , x^)) and {Y, dY,{yi, ■ ■ ■ , yk)) be metric spaces, each with an ordered set 
of k distinguished points (we call such spaces k-pointed metric spaces). When fc = 1, we simply 
refer to pointed (rather than 1-pointed) metric spaces, and write {X, dx,x) rather than {X, dx, {x)). 

The k-pointed Gromov-Hausdorjf distance is defined as 

d%^{X, y) = \ inf {r : 3C e ^(X, Y) such that {xuyi) & C,l <i <k and dis(C) < r} . 

It is straightforward to verify that for each k, the space (A^*^, c^gh) marked isometry-equivalence 
classes of fc-pointed compact metric spaces, endowed with the distance dg^, forms a complete sep- 
arable metric space. 

Couplings and Gromov-Hausdorff-Prokhorov distance 

Let {X,d,p) and {X',d',^') be two measured metric spaces, and let tt be a Borel measure on 

X X X' . We say is a coupling between /i and /i' if < /i and p'^ i' < /i', where p : X x X' ^ X 
and p' : X X X' ^ X' are the canonical projections. The defect of v is defined as 

D{y) = m£uc((M - P.y){X), {p' - 

We let C{fi, be the set of couplings between p and p', and for e > we write Ce{p, p') = {tt e 
C{p,pi'):D{v)<e} 

The Prokhorov distance between two finite positive Borel measures p, p' on the same space 
(X, d) is 

d° (yu, p') = inf{e > : p{F) < p'{F^) + e and < /i(F') + e for every closed F C X} . 

There is another distance which generates the same topology and lends itself more naturally to 
combination with the correspondences introduced above. We define 

dp{p, p') = inf {e > : 3i/ e Ce{p, p') , I'iHx, x') € X x X \ d{x, x') > e}) < e} . 

The Gromov-Hausdorff-Prokhorov (GHP) distance between X = {X, d, p) and X' = {X' , d',p') 
is defined as 

H (Y Y'\-ir,ff.^n- 3!/eCe(/i,M')andi?e^^'(X.X')suchthat 1 

We always have dGHp(X, X') > dGp(X, X'). Similarly to before, the collection M of measured 
isometry-equivalence classes of compact metric spaces, endowed with the distance dcup, forms a 
complete separable metric space. 
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Given fc-pointed measured metric spaces X = {X, dx , H, {xi, . . . ,Xk)) and X' = {X' , d', /i', (x'l, . . . , x'^,)), 
we define the k-pointed Gromov-Hausdorjf-Prokhorov distance as 

Y'^ - iv^f n ■ e C,(/x,/i') and i?e'ir(X,X') such that \ 

doHplA, A j - mt |e > U . ^ ^.^^^^ ^ and (x„ e i?, 1 < i < fc J ' 

Once again, we may define an associated complete separable metric space (A^*^, rfcHp)- 

3 Cutting down uniform Cayley trees 

3.1 The Aldous-Broder dynamics 

Given a simple random walk {Xn}nen on a finite connected graph G, we may generate a spanning 
tree T of G by including all edges {Xk,Xk^i) with the property that X/j+i ^ {-'^^i}o<j<A;- The 
resulting tree T is in fact almost surely a uniformly random spanning tree of G. (More generally, 
if G comes equipped with edge weights {Wf. : e G e{G)}, then the probability the simple random 
walk on the weighted graph G generates a specific spanning tree t is proportional to neee(t) ^e-) 
This fact was independently discovered by Broder [14] and Aldous [2], and the above procedure is 
commonly called the Aldous-Broder algorithm. 

By reversibility, the tree T generated by the Aldous-Broder algorithm may be instead be viewed 
as generated by a simple random walk {X„}„<o on G, started from stationarity at time —oo (see 
[32], pages 127-128). If instead of stopping at time the walk at time zero we instead stop at time 
i > 0, then the walk {Xn]n<i gives another tree, say T^. What we call the Aldous-Broder dynamics 
is the (deterministic) rule by which the sequence {Ti ,i > 0} is obtained from Tq and from the 
sequence {X„,n > 0}. In the current section, we explain these dynamics. In the next section, 
we introduce a modification of the Aldous-Broder dynamics, and use it to exhibit the key coupling 
alluded to in Section 1. 

Recall that given a rooted tree t and x E v{t), t{x) denotes the subtree of t rooted at x. Fix 
an integer n > 1 and a tree t on [n], and let x = (xi)i^f,] be a sequence of elements of [n] = 
{1,2,. 

We then form a sequence of trees {r™(i,x) : m e N}. First, T'^ = t. Then, for m > 0, we 
proceed as follows. 

• If a;„+i = r(T™) then = T™. 

• If Xm+i 7^ r{T"^) then form by removing the unique edge of T™ with tail then 
adding the edge {xm, x^+i), and finally rerooting at x^+i- 

In all cases, r{T™-) = Xm for all to > 1. We refer to this procedure as the Aldous-Broder dynamics 
on t and x. One can equivalently think of the root vertex as being both the head and tail of a directed 
loop; then one always removes the unique edge with tail x„j+i in T'" and adds the directed edge 
{xjn, Xm+i)- Taking this perspective, leti?„j_|_i = _R„i+i(t, x) be the subtree of T™ rooted at a;„i+i, 
so Rm+1 = T™{xm+i)- Let Kjn+i = Km+i{t, x) be the other component created when removing 
the edge with tail x^+i, which is empty if x„j+i = Xm and otherwise contains Xm+i- In all cases 
rpm+i jg obtained from Rm+i and Km+i by adding an edge from Xm to x^+i (see Figure 1). 

3.2 A modified Aldous-Broder dynamics 

Say that a sequence x e [n]^ is good if for each k e [n], sup{i : Xi = k} = oo. Fix a tree t on 
[n] and a good sequence x. We now describe a rule for removing a set of edges from t to obtain an 
ordered forest F — F{t, x) on [n]. (Recall that an ordered forest is an ordered sequence {ti, . . . , tk) 
of rooted trees.) 

In words, to build F(t,x) we start from the tree t and make the cuts that are dictated by the 
sequence x, but ignore any such cuts that fall in a subtree we have already pruned at an earlier step. 
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Figure 1. Two successive trees T' and T'"*"^ built from the sequence construction: T'+i is obtained from 
by cutting above Xi+i and rearranging the parts in such a way that the subtree above the cut is appended as a 
child of the root Xi+i of the subtree Ri+i below the cut. 

t F(t,x) f{t,x) 



^5 X\ X2 Xi X5 Xi X2 Xi X5 




Figure 2. Left: a tree t, with node labels suppressed for readability; the first five nodes xi, . . . ,X5 of some 
good sequence are marked in the figure. Center: the forest F(t, x) built by applying the modified Aldous- 
Broder dynamics to t with any sequence x starting with xi, . . . , X5. The trees are Ti , x) , . . . , r4 (t , x) are 
shown from left to right, and ri = Xi,r2 = X2,r-i = Xi, ri = 15. Right: the tree T(t, x), which has root xi. 



Since x is good, we will eventually prune the root r{t) and so we will ignore all but finitely many of 
the cuts. 

Formally, let o-q = and, for i > 1, let 

r .-1 I 

(Ti = inf < m > cri_i : Xm ^ [J t{xa,) > • 

Then let k = n{t, x) = inf{i : x^. ~ r{t)}. Note that we always have <Ji = 1, that n < 00 since x 
is good, and that for all j > k, aj — 00. Recall that we write t — {v{t), e{t)), where v{t) and e{t) 
denote the vertex and edge set of t, respectively. After all the cuts in x have been made, we are left 
with a graph 

/ = {v{t),e{t) \ {(x,,,a(x,J), l<i< k}). 

For 1 < i < K, let Ti = x) = C{xa. , /). Note that Ti is a tree, which we view as rooted at 
Xa- . We then take 

F = F(t,x) = (ri,...,r,). 

Write ri = ri{t, x) for the root of Ti and note that r„ — r{t). Finally, write T ~ T{t, x) for the tree 
obtained from the forest F(t, x) by adding a directed edge from the root of T^+i to the root of Ti, for 
each i E [k — 1], and rooted at ri (as suggested by the orientation of the edges). These definitions 
are illustrated in Figure 2. We call this procedure the modified Aldous-B rode r dynamics on t and x. 

Remark. The cutting procedure described above differs slightly from that used in much of the work 
on the subject. More precisely, it is more common to cut the tree by the removal of random edges 
rather than the selection of random vertices. However, there is a close correspondence between the 
vertex selection procedure and the edge selection procedure on a planted version of the same tree. 
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which means results proved for one procedure have immediate analogues for the other. In particular, 
Janson ([23], Lemma 6.1) analyzed the difference between the two variants and showed that it is 
asymptotically negligible. 

Now let X ~ {Xm)jn(zfi be a sequence of i.i.d. uniform {1, . . . , n} random variables. It is easily 
seen that X is good with probability one. The following theorem is the key fact underlying almost 
all the results of the paper. 

Theorem 3.1. Let T be a uniform Cayley tree on [n]. Then for any tree t on [n] and any w € [n], 

P(f (T, X) = t and r{T) ^ w) = n"". 

Since there are n"^^ labelled rooted trees on [n], there are possible ways to choose a labelled 
rooted tree on [n], plus an additional vertex of said tree. In other words, the theorem states that 
T(T, X) is a uniform Cayley tree, and that r{T) is uniform on [a] and independent of T(T, X) (and 
that r{T) is uniform on [n], but this is immediate from the fact that T is a uniform Cayley tree). 

Proof of Theorem 3.1. We proceed by induction on n, the case n = 1 being trivial. So we now 
suppose that n > 1. First, consider the case when w — r{T); we have r{T) = r{T) precisely if 
Xi = r{T) and in this case T = T. Thus, for any rooted tree t on [n], 

P(f = t, r{T) = r(f )) = P(Xi - r(T), T ^ t) = -V{T = t) = ^, 

since T is a uniform Cayley tree. 

Next, fix a rooted tree t on [n] and any w G [n], w ^ r{t). Let c = c{t, w) be the child of 
r = r{t) for which the subtree of t rooted at c contains the node w. Let tr and tc be the subtrees 
containing r and c, respectively, when the edge (c, r) is removed from t. If we are to have r{T) = w 
and T = t then tr must appear as a subtree of T, and we must additionally have Xi ~ r. Since T is 
a uniform Cayley tree it follows that 

P(r(r) ^w,f^t) 

= P(r(T) = f = <, is a subtree of T, Xi = r) 

(n - \t h""!*--! 1 

= -5^ ^-^^ PMT) ^w,T^t\ is a subtree of T, Xi = r). (1) 

Now let X' = (X-)ieN be the subsequence of X consisting of the nodes of Ki{t, X), the connected 
component of T containing the root after the edge above Xi has been removed: For i e N, let 

j,^mm{e:\{X,,...,X,}nviK,it,X))\=i}, 

and set X'^ = Xj. . Given that tr is a subtree of T and Xi = r, the entries of X' are independent, 
uniformly random elements of v{tc)- Furthermore, under this conditioning we have that T(T, X) = 
t and r(T) = w precisely if f{Ki{T, X),X') = t^ and r{Ki{T, X)) = w. Since T is a uniform 
Cayley tree and Ki {T,X) is obtained from T by removing the subtree rooted at Xi, it is immediate 
that conditional on its vertex set, Ki (T, X) is again a uniform Cayley tree (and has less vertices than 
T). By induction, it follows that 

P(r(T) = w, f = t I t,. is a subtree of T, Xi = r) 

= P{f{KiiT,X),X') ^tc,r{Ki{T,X)) ^w\tr is a subtree of T, Xi = r) 

Since \tc\ = n - \trl together with (1) this yields that P(f (T, X) = t and r(T) = w) = n"", as 
required. □ 
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We can transform the modified Aldous-Broder procedure for isolating the root into an edge- 
removal procedure, as follows. First, plant the tree to be cut at its root. Next, each time a node is 
selected for pruning, instead remove the parent edge incident to each selected vertex. The Aldous- 
Broder procedure then becomes the planted cutting procedure described in the introduction, and 
k(T, X) is precisely the number of edges removed before the planted vertex is isolated. But k(T, X) 
is also the number of vertices on the path from r{T) to r{T) in T. By Theorem 3.1, and from known 
results about the distance between the root and a uniformly random node in a uniform Cayley tree 
[3, 5, 6, 27, 36], the case fc = 1 of Theorem 1.1 and of Corollary 1.2 follow immediately. By a 
well-known bijective correspondence between labelled rooted trees with a distinguished vertex and 
ordered labelled rooted forests [see, e.g., 7], Theorem 1.3 also follows immediately. 

Remark. Aldous [4] studied the subtree rooted at a uniformly random node in a critical, finite 
variance Galton-Watson tree conditioned to have size n. In particular, he showed that such a subtree 
converges in distribution to an unconditioned critical Galton-Watson tree. It is then straightforward 
that, for fixed fc > 1, the first k trees that are cut converge in distribution to a forest of k critical 
Galton-Watson trees. On the other hand, a critical Galton-Watson tree conditioned to be large 
converges locally (in the sense of local weak convergence of [9], i.e., inside balls of arbitrary fixed 
radius k around the root) to an infinite path of nodes having a size-biased number of children (exactly 
one of which is again on the infinite path), with each non-path node the root of an unconditioned 
critical Galton-Watson tree. This is the incipient infinite cluster for critical, finite variance Galton- 
Watson trees [26]. Theorem 1.3 then appears as a strengthening of this picture, vaUd only for Poisson 
Galton-Watson trees, in which k is allowed to grow with n. 

Recall that T is a uniform Cayley tree on [n] and that X = {X„i)„ien is a sequence of i.i.d. 
uniform elements of [n]. In the next proposition, which is essentially a time-reversed version of 
Theorem 3.1, we write F(r, A") = F for readability. 

Proposition 3.2. For any forest f — {ti, . . . ,tk) on [n], given that F = f , independently for each 
i G [k — 1] the parent a{r{ti), T) ofr{ti) in T is a uniformly random element ofUj^^^iv{tj). 

Proof. If A: = 1 then there is nothing to prove. If fc > 1 then fix any sequence v = {vi, . . . , 
with Vi £ [Jj^^^iv{tj) for each i E [fc — 1]. Write t{i, v) for the tree formed from f by adding an 
edge from r{ti) to u^+i for each i e [fc — 1]. In order that F = f and that, for each i e [fc — 1], 
a{r{ti), T) = Ui+i, it is necessary and sufficient that T = t(f , v) and that for each i e [fc], X^. = 
r{ti). The probability that T = i(f , v) is n~^"~^'. Furthermore, since {X„i)men are i.i.d. elements 
of [n], 

V{X,^^r{U),\<i<k\T = t{i,w))= n III \, ^i - 

It follows that 

P (F = f and a{r{U),T) ^ 1 < ^ < fc) = -L^ . ^ ^j";^^' 

which proves the proposition since this expression does not depend on wi , . . . , Wfe. □ 
Theorem 1.4 is an immediate consequence of Proposition 3.2. 



4 Isolating more than one vertex 

In this section we describe how to generalize the arguments of Section 3.2 to obtain results on 
isolating sets of vertices of size greater than one. Recall that when performing the planted cutting 
of S in t, described in Section 1, we wrote W — {wi, . . . ,Wk] for the set of new vertices, and 
wrote M ~ M{t, S) for the (random) total number of edges removed. In order to study the random 
variable AI, it turns out to be necessary to study a transformation of the planted cutting procedure. 
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The modified procedure is defined via a canonical re-ordering of the sequence of removed edges. 
As such, it may be coupled with the original procedure so that the final set of removed edges is the 
same in both. In particular, both procedures isolate the vertices of W, and the total number of cuts 
has the same distribution in both. 

In the following, for an edge e and a connected component C, we write e e C to mean that both 
end points of e lie in C, or equivalently (since the connected components are trees) that the removal 
of e leaves C disconnected. Also, recall from Section 2 that given a set A of edges, we write t \ A 
for the graph {v{t),e{T) \ A). 

Now fix a sequence e = (ei, . . . , Cm) of distinct edges of t. We say that e is a possible cutting 
sequence (for S in t) if 

• each edge {vi, Wi}, 1 < i < k appears in e (e really isolates wi, . . . , Wk), and 

• for each 1 < j < m, one has ej E C{W, t \ {ei, . . . , ej_i}), that is, each indeed produces 
a cut. 

We now describe a canonical re-ordering of e, which we denote e* ; this re-ordering operation gives 
rise to the modified cutting procedure. In e*, we first list all edges whose removal decreases the 
size of the component containing wi (in increasing order of arrival time). We then list all remaining 
edges whose removal decreases the size of the component containing W2, again in increasing order 
of arrival time, and so on. (This is somewhat related to a size-biased reordering of an exchangeable 
random structure - see [40], Chapter 1.) The next three paragraphs formaUze this description. 
For 1 < i <k, write 

U^ = Ui{e) ^ {j : GC(w„t\{ei,...,ej_i})}, 

and let U* = Ui \ (Uj=i ^j)- words, U* is the set of times j at which the component containing 
Wi does not contain any of wi, . . . , Wi-i, and such that removing the current edge Cj decreases the 
size of this component. 

Next, let m(i) = m{i,t,e) — \Ui\, write Zi — Zi{e) = {zi_i, . . . ,Zi „i{i)) for the sequence 
obtained by listing the elements of Ui in increasing order, and define Z* accordingly. Notice that 
once Wi is in a component distinct from wi, . . . , it can never rejoin such a component, and so 
writing s{i) ~ s{i, t, e) = min{£ : Zi,i G U*}, we must have 

^i ; ^i,s(z) + l ! • ■ • ; ^i,m(i) ) ■ 

We then write 

^ ~ (^Zl,s(l) ) ■ ■ ■ ) ^Zl,m(l) ' ^22.3(2) 1 ' " ' J 6^2, m(2) > " ' ' > Czfc,s(fc) ) ' " ' 1 ^Zk^m{k) ) 

(^1 J ■ ■ • J ^m) 1 

the latter equality constituting the definition of e^, . . . , ej„. For 1 < i < fc, let ai{t, e*) = 1 + 
- s(^) + 1) let h{t,e*) = ELi(^"(^) - + 1)' and set 

= {e*j,ai <j< bi) = {ez^ .^,s(i) <j< m{i)) . 

We remark that necessarily e^. ^^.^ = {wi, Vi}, and so in particular the sequence e* is non-empty 
for each 1 < i < fc. 

Now write E ~ E(i, S) ~ {Ei, . . . , Em) for the random sequence of removed edges (in the 
original planted cutting procedure), write E* = E* {t, S) = {E^ , . . . , E^j) for the rearrangement of 
E described above, and likewise define E*, for 1 < i < fc, as above. 

It is easily seen that if e is not a possible cutting sequence then P (E(i, S) = e) = 0, and if e is 
a possible cutting sequence then 

m ^ 

P(Eft,5)^e)^n I fnnA/,\ S ^2) 

fj{ \e[C{W,t\{ei,...,ej^i\))\ 
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For our purposes, it is in fact the expression for P (E*(t, S) = e*) given in the following lemma 
that will be more useful. Fix any f = (/i, . . . of edges of f(S'). If there exists a possible cutting 
sequence e = (ei , . . . , e^) for 5 = {vi, . . . ,Vk) int such that e* = f , then we say that f is valid 
(for t and S). 

Lemma 4.1. Given any sequence f = (/i, . . . , /„) that is valid for t and S, we have 

Proof. We prove the lemma by induction on \e{t{S))\. Fix f as in the statement of the lemma, write 

£{f) = £{f, t,S) = {e : e is a possible cutting sequence for 5 in t and e* = /} , 
and note that f e ^ (f ). For any e = (ei, . . . , e^) e f we necessarily have ei = /i, and so 

P {-Elit, S) = /i) = P (Ei(t, S) = A) = 

If ei = {ui, wi} then, writing S' = {v2, ■ ■ ■ ,Vk), we have 

P (E* = f I E*i = /i) = P (E* = f I El = /i) 

= P(E*(i,S') = (/2,...,/™)) , 

and the result follows by induction since t{S') has fewer edges than t{S). 

If ei 7^ then write ti = C{wi,t{S) \ {ei}) and write t2 for the other component 

of t{S) \ {ei}; each of these trees has fewer edges than t{S). Write 5i = (xi, . . . ,Xki) and 
5*2 = (yi ) • • • , ?/fe2 ) for the nodes of S within ti and t2, respectively, listed in the same order as in S. 

Now fix any possible cutting sequence e = (ei, . . . , e^) with ei = /i. Write e^^' and e^^' for 
those edges in the sequence (e2, . . . , 6^) falling in ti and t2, respectively, and listed in the same 
order as in e. Then it is clear that, conditionally on Ei = /i, the sequences E(ti, 5*1) and E(i2, 52) 
have the distribution of the planted cutting procedure on ti{Si) and t2{S2), respectively, and are 
independent. In other words, 

P (E(ii, 5i) = e(i), E(i2, S2) = e(2) | Ei = /i) =P(E(ti, 5i) = e^'^) ■ P(E(t2, ^2) = e^^)) . 

Furthermore, if e e £:(f) if and only if e^^) G S{i'^^\ti, Si) and e^^) g £{f(^\t2, S2). (Note: 
this does not mean that the map from e to (e^^^ e^^') is bijective! In fact, for a given pair e^^' G 
£{f^^\ti,Si) and e^^) e £{f'^^\t2, S2), the number of pre-images in 5(f) is precisely C^"^), 

where mi is the length of t^^\) Also, f (resp. f ^^^) is valid for ti and Si (resp. for t2 and S2). It 
follows that 

P (E* = f I El = /i) 
= ^P(E = e|Ei=/i) 

= E E P(E(ti,5i) = e«,E(t2,52)=e(2)|Ei=/i) 

e(i) e£(f (1) ,ti ,Si) e(2) £f (f (2) ,t2,S2) 

E E P(E(ii,5i) = e«)-P(E(i2,52)=e(^)) 

e(i)ef:(f(i),ti,Si)e(2)ef (f(2),i2,S2) 

= P(E*(ti,5i)=f(l))-P(E*(t2,52)=f(")) , 

from which the result again follows by induction. □ 
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The formula in the preceding lemma implies that removing edges in the order given by E* 
corresponds to the following procedure. For each 1 < i < fc, in that order, remove edges of t 
uniformly at random from among those whose removal reduces the size of the component currently 
containing w, , until Wi is isolated. We call this the ordered cutting of S in t. 

For 1 < i < fc, write Mi for the random time at which Wi is isolated in the ordered cutting 
procedure: 

= Nh{t, S) = max{j : E* e C[w,,t \{E{,..., E*_^})} 
= min{j : \C{w,,t\{El, . . . , E*})\ ^ Q}, 

and note that Mi < M2 < ■ ■ ■ < Mk = M. 

Now, let T be a uniform Cay ley tree on [n], let Vi, . . . ,Vk be independent, uniformly random 
elements of [n], and let Sk — {Vi, . . . , Vk). Then write Mk = M{T, Sk) for the number of edges 
removed during the ordered cutting of Sk in t. 

Theorem 4.2. Mk — k is distributed as the number of edges spanned by the root plus k independent, 
uniformly random nodes in a uniform Cayley tree of size n. 

Theorem 1.1 follows immediately from Theorem 4.2 and the relationship between planted cut- 
ting and ordered cutting described above. To prove Theorem 4.2, we will exhibit a coupling which 
generalizes that of Section 3.2 and which we now explain. The coupling hinges upon the following, 
easy lemma, whose proof is omitted. Recall that if 5 is a set of nodes in a tree t, then t\S'\ is the 
subtree of S spanned by 5*. 

Lemma 4.3. Fix i > 1. Let T be a uniform Cayley tree on [n], let Vi, . . . , Vi-^i be independent, 
uniformly random elements of [n], and let S — {r(T), T^i, . . . , Vi\. Let U be the most recent 
ancestor ofVi^i in T which is an element of v{T\S\). Let R be the set of nodes whose path to 
Vi+i uses no edges ofT\S\ (such paths may pass through U). Let — TfRl and let = 
T|([n] \ i?) U Then conditional on R, is a uniformly random labelled rooted tree on R, 

independent ofT^ and ofVi, . . . ,Vi, and Vi^i is a uniformly random element of R independent of 
T+ , , and Vi, . . . ,Vi. 

The definitions in Lemma 4.3 are depicted in Figure 3. 



r{T) 




Figure 3. An example of the definitions of Lemma 4.3 in the case i — 2 (so S = {r{T), Vi, V2)). The subtree 
rJS] is in thicker black lines. The tree is in thick grey lines, and the tree consists of all black lines 
(thick and thin). 
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Figure 4. Left: the tree r((V^i)). Center: The tree r''"^i, planted at Vi. Right: the tree Ti. The vertex and 
edge labels provide an example of the construction in the proof of Theorem 4.2, in the case k = 1. For each 
of the three trees, the forest obtained by removing the bold edges (and, for r{(Vi)), then throwing away the 
vertex wi) is identical. 

Proof of Theorem 4.2. We shall provide a coupling between the random sequence of edges E*(T, {Vi, . . . , V^)) 
and a sequence Ti, . . . ,Tk of trees on [n], such that the following properties hold. First, for any 
rooted tree t on [n], and any vi, . . . ,Vi elements of [n] (not necessarily distinct), 

P {T, = t,Vi^v,,...,V,^ V,) = (3) 

Second, for each 1 < i < k, the following holds: 

(★) the forest obtained from T{{Vi, . . . ,Vi)) by first removing all edges of {E^, . . . , Elj,}, then 
deleting wi, . . . , Wi, is identical to the forest obtained from Ti by removing all edges of its 
subtree T,Ir(r,),yi,...,y,l. 

Equation (3) says that Ti is a uniform Cayley tree and l^i, . . . , are independent of Ti, and (★) then 
implies in particular (by considering only the case i = k) that Mk — k is equal to the number of 
edges of Tklr{Tk), Vi, . . . , Vkj. This clearly implies the theorem, and so it remains to explain how 
we construct such a sequence. 

Fix a sequence X = {Xi)i>i of i.i.d. uniform elements of [n]. Let Ti be the tree built by running 
the modified Aldous-Broder dynamics on T'^^^'^ (recall that this is the tree T, rerooted at node Vi) 
with the sequence (X,),>i. (In the notation of Section 3.2, Ti = f{T''^^^ , X).) By Theorem 3.1, 
for any tree t on [n] and any v E [n], P {Ti = t,Vi = v) = n^", so (3) holds in the case i = 1. 
Temporarily write ui, . . . ,ug for the nodes on the path in Ti from r(Ti) to Vi, in the same order 
they appear on that path. We must then have = Vi, and Mi = £. For 1 < j < £ — 1, let 
E* = {uj , a{uj , T*"^^! )}, and note that this is also an edge of T since T and T""^^^ have the same 
edge set. Then let E'^.j^ = {u£,wi} = {Vi,wi}. (An example of this construction is shown in 
Figure 4.) By construction, it is immediate that (*) then holds in the case i = 1. 

Now fix 1 < j < fc, suppose that Ti, . . . ,Tj and {E^ , . . . , E^j, ) are already defined and that (3) 
and (★) both hold for each 1 < i < j. As defined, Vj+i is independent of Tj and of {E^ , E^.j, ), 
and so for any tree t on [n] and any sequence ui, . . . , Uj+i of elements of [n], we have 

P{Tj=t,V,=u,,..., Vj+, = = n-("-i+^+i). (4) 

Let U be the most recent ancestor of Vj+i that lies in Tj lr{Ti), Vi, . . . , Vj}, and define T+ and T^ 
as in Lemma 4.3. 

Now let X' be a random sequence such that conditional on w(T+), the entries of X' are inde- 
pendent uniform elements of u(T+), independent of all preceding randomness. Then, apply the 
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modified Aldous-Broder dynamics to 7^+ '"^^3+1^ and call the result T* . By Theorem 3.1, given 
that v{T+) = S, (r+, Vj+i) and {T*,Vj+i) are identically distributed. As above, let ui,.. .,Ui 
be the nodes on the path from r{T*) to V,+i, and note that we must have M^+i = Mj + i. For 
1 < i <£~l\tiElj^^. = {Mi,a(Mi,r+'''^^^+i)}, andletS*^^.^^ = {V,+i,Wj+i}. In words, we 
have applied exactly the same construction as in the case i = 1, but to the subtree r+ of T (which 
contains V^+i). Figures 3 and 4 may be useful as visual aids to these definitions. 

Write P for the parent of U in Tj, and Ci, . . . , for the children of U in Tj \ (any such 
child is an ancestor of at least one of Vi, . . . , V,). Now let Tj+i be the tree obtained from Tj by 
replacing by T* . In other words, Tj^i is built from Tj by, first, removing all edges of Tj that 
are incident to nodes of T+; then, second, adding all edges of T* as well as edges from the root of 
T* to P and to each of Ci, . . . , Cf. With this construction, (*) now holds for all 1 < i < j + 1. 

Finally, write R = v{T+). By Lemma 4.3 and by Theorem 3.1, {T+,Vj+i) and {T*,Vj+i) 
are identically distributed conditional on their vertex sets, and both are independent of T^ and of 
Vi, . . . ,Vj. It follows that (4) still holds with Tj replaced by T^+i, and this verifies (3) and completes 
the proof by induction. □ 



5 A novel transformation of the Brownian CRT 

In [24], Janson suggested that it should be possible to define a version of the cutting procedure 
directly on T- In this section, we provide such a construction. This construction yields straightfor- 
ward, "conceptual" proofs of some of the main results of [24], and also provides a novel, reversible 
transformation from T to another, doubly-rooted Brownian CRT. (We remark in passing that the 
results of this section can also be straightforwardly used to prove the first convergence result from 
Theorem 1.10 of [24].) Using the by now well-known coding of the Brownian CRT by a standard 
Brownian excursion, this transformation can be viewed as a new, invertible random transformation 
between Brownian excursion and Brownian Bridge. 

We now describe the details of the construction, using the language of M-trees. For the inter- 
ested reader, we describe the corresponding transformation from Brownian excursion to reflecting 
Brownian bridge, in Appendix A. 

We begin with a quick, high-level description of the transformation. An initial compact real 
tree T distributed as the Brownian CRT will be cut by points falling on its skeleton. When a point 
arrives, the current tree is separated into two connected components; the one containing the root 
will suffer further cuts at later times, while the other one - the pruned tree - will no longer be cut. 
As in the discrete transformation of Section 3.2, the cut trees are rearranged by attaching thek roots 
to a "backbone" so as to form a new real tree. We now describe the continuous transformation by 
first building the backbone that will eventually connect the roots of the pruned subtrees, and then 
specifying where these subtrees should be grafted along the backbone. 

5.1 The details of the transformation 

Let V he a Poisson process on skel(T) x [0, 00) with intensity measure £ (g) dt, and for each t > 0, 
let 

V° ^{xeT : 3s, 0<s<t, (x, s) e V}. 

In [8], Aldous and Pitman used the point process V to construct (what is now called) a self-similar 
fragmentation process on T [10]. For each t > 0, let ^ T \ Vt . In particular, two points 
u^v e T \Vi are in the same component of precisely if, in T, the path \u, v\ contains no 
element of . Aldous and Pitman [8] established many beautiful facts about how the collection 
of masses of the components of 7^^° evolve with t; one basic fact from [8] is that that a.s., for each 
i > 0, has only countably many components, and the total mass of all components of is one. 
(This seems intuitively obvious, but note that it is a priori possible that for every t > {), F^ contains 
uncountably many components, each of mass zero.) 
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Description of the backbone. For t > 0, write % for the component of containing the 
root at time t; then define a process {L{t), t > 0) by setting 

L{t) = / ^i{fs)ds. (5) 
Jo 

The process L{t) is the continuum analogue of the "number of cuts by time t"; the process {L{t) , t > 
0) will code the distance along the backbone in the continuum transformation. 

Theorem 6 of [8] states that if we define an increasing function {X{t),t > 0) by 

(M7iM>0)=^^, (6) 

then X{-) is a stable subordinator of index 1/2, or in other words is distributed as the inverse local 
time process at zero of a standard reflecting Brownian motion. The function X{-) has almost sure 
quadratic growth, and it follows that L{oo) :— limf^oo L{t) is almost surely finite. (The proof of 
Corollary 5.6, below, contains a different proof that L(oo) is almost surely finite, using the principle 
of accompanying laws). 

The pruned subtrees, and their grafting on the backbone. Since "P is a countable set, 
we may enumerate its atoms as ((p^, Tj), i G N). For t > 0, let 

= {i e N : < T, < t, ^l{fr,) < M(tr,-)}, 

and let 

Let Poo = liiiit^oo Vt and and let /qo — limt_j.oo It- Next, for < t < oo, let Ft —T \ Vt, let dt 
be its intrinsic distance: for points .t, y in the same component of Ft, we have dt{x, y) — d{x, y), 
while for x, y in distinct components of Ft, we have dt{x, y) — oo.^ , and let fit be the restriction 
of fi to Ft- Then let {Ft, dt) be the metric space completion of {Ft, dt), and let fit be the extension 
of fit obtained by assigning measure zero to all points of Ft\Ft', note that there are only countably 
many such points^. 

Next, write Tt for the component of Ft containing p. We then have that a.s. for all t > 0, 7* is a 
connected component of Ft, and that a.s. 

{f^{ft),t>0)^ifit{Tt),t>0). (7) 

By definition, a.s. for every < s < t, every component of Fg not containing p is also a component 
of Ft- This naturally extends to the completions Fs and Ft- 

For < i < oo, let (j>t be the identity map from Ft to T, and let (pt be the unique extension of (pt 
to Ft whose restriction to any component of Ft is a continuous function. With probability one, for 
each i E I,pi has degree two in T and also in F^ - - It follows that almost surely, for each e /qo , 
Jv- \ J-V,_ contains precisely two points. Call these points Xi and yi, labelled so that Xi ^ T5-. and 
T/i G Tt^- Write fi for the component of J>. containing Xi. Necessarily, Xi G fi\Ft and p.; = (t){xi) 
is the closest point of 4>{fi) to p; in other words, pi is "the root of the subtree cut at time t^". Also, 
Xi and yi are both leaves in F^ - For distinct points pi , pj G /* the trees fi , fj are disjoint, so in 
particular Xi ^ xj. 

The space {Foo, doo, Poo) is the limiting analogue of the forest F from Section 3.2. We note that 
{T, p) can be recovered from {Foo,doo, Poo) by identifying Xi and yi for each pi G loo, and taking 
as measure the corresponding push-forward of poo- 

'See [15], Sections 2.3 and 2.4, for the general definition of intrinsic distance for a subset of a metric space. 

^The assiduous reader may ask: the forest {Tt, dt, ^t) is meant to be a random element of what (Polish) space? One 
possible answer is to view this forest as given by some random function et ■ [0,1] — > [0, oo) with et (0) = et (1) = 0, and 
with the "components" of the forest separated by the zeros of et ; this perspective is elaborated in Appendix A. However, this 
forest itself is essentially introduced for expository purposes and plays no role in the sequel; as such, the details of how to 
formalize the definition of {J-'t , dt , fit) are unimportant in the remainder of the paper. 
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For < t < oo, let At be the real tree consisting of the Une segment [0, L{t)] with the standard 
distance. Then form a measured M-tree {7t,dt,, jit) from At and J^t \ Tt, by identifying G fi 
and L{Ti) e [0, L(/;)], for each i E It, with measure fit given by the push-forward of fit\j^^\Tt- 
(We justify that {7t,dt, fit) is indeed a well-defined random R-tree, using a coding by excursions, in 
Appendix A.) We naturally view these spaces as increasing in t. Write T = Too, d = d^o, fi = fioo, 
and let u = L{Q) and v = L{oo). Almost surely both u and v are elements of T. 

The set of points of \u, v\ of degree greater than two in (T, d) are precisely the images in T of 
the points {xi,i e loo} in Too, and if a; is the image of such a point Xi then d{u, x) — L{Ti). It 
follows that the set of times {tj, i G /oo} is measurable with respect to (T, d). Also, a.s. {yi, i E 
loo} n {xi, i g loo} = 0, so we may view the points {yi, i e /qo} as points of T. 

Now recall the definition of (T, d, fi, p) from the start of the section, and let p' be a point of T 
selected according to p and independent of p. 

Theorem 5.1. It holds that (7~, d, fi, {u, v)) has the same distribution as {T , d, p, (p, p')). Further- 
more, conditional on {T , d, fi, (u, v)), the elements of {yi, i € loo} <^re mutually independent, and 
for all i G /qo, Vi is distributed according to the probability measure fl\-J-^^J-^ /(I ^ fj-{Tt))- 

We remark that Theorem 1.5 is an immediate consequence of the first assertion of the theorem. 
Likewise, Theorem 1 .7 immediately follows from the definitions of (T, d, fi, {u,v)) and of the points 
{yi, i G loo} and from the second assertion of the theorem. 

The remainder of Section 5 is devoted to the proof of Theorem 5.1. The proof of Theorem 5.1 
relies on couplings with the construction for uniform Cayley trees, and we introduce these cou- 
plings in Section 5.2. In Section 5.3, we show that the process {L{t),t > 0) is indeed the correct 
analogue of "number of cuts" in the discrete setting. Finally, we wrap up the proof of Theorem 5. 1 
in Section 5.4. 

5.2 Some couplings between discrete and continuous trees 

The couplings we introduce in this section are not specific to the case of uniform Cayley trees. 
This will be important in Section 6, when we extend our results to other finite-variance critical 
conditioned Galton- Watson trees. 

Let ^ = {S^i,i > 0) be a critical finite-variance offspring distribution, that is, a probability 
distribution on {0, 1, . . .} with 

y^J^i and ^ i{i - 1)^^ G (0, oo). 

i>0 i>0 

In the following, we consider only values of n such that a sum of n i.i.d. random variables with 
distribution ^ equals n — 1 with positive probability. For such n > 1, let T" be a Galton- Watson 
tree with offspring distribution ^, conditioned to have n nodes. For x,y E T" let y) be an^^^^ 
times the graph distance between x and y in T" . Let denote the root of T", let /i" be the measure 
placing mass 1/n on each node of T", and let £" be the measure placing mass an~^/^ on each 
vertex of T" (the "discrete, rescaled length measure"). Let next, T be the Brownian CRT with root 
p and distance metric d, let p be its mass measure, and let £ be the length measure on the skeleton 
of T. We will use the following fundamental result heavily. 

Theorem 5.2 (Aldous [6], Le Gall [31]). It holds that 

{T\d\p\pnA{T,d,p,p) 
as n ^ oo, where convergence is in the 1 -pointed Gromov-Hausdorff-Prokhorov sense. 

First, by Skorohod's representation theorem (see, e.g., [13]), we may consider a probability 
space (il, F, P) in which we have the almost sure GHP convergence 

(T",d",A^",p")^(r,d,M,p). 
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In such a space, we may find a sequence of correspondences (i?„,n > 1) between T" and T, 
such that dis(i?„) almost surely as n — > oo. We may also find a sequence of couplings 
{un, n > 1) between /i" and fi such that the defect D{vn) almost surely as n — > oo, and such 
that VniRn) ~^ almost surely as n oo. 

Next, let {si,i > 1) be a random sequence of independent points of T distributed according 
to /i, and for each n e N let (s", i > 1) be a sequence of independent points of T" distributed 
according to fi" . Also, write sq = p and Sq — p" for notational convenience, and for fc > 1 write 
SJ^ = {sq , . . . , sj!}. The almost sure GHP convergence above implies that for each fixed fc > 1, 

(T", d", m", (4\ • • ■ , 4)) 4 (T, d, /i, (so, . . . , Sk)) , 

in the sense of d^^p, and Skorohod's theorem (applied once for each fc > 1) then implies that we 
may work in a space in which almost surely, for all e > 0, 

lim inf {fc : d^+i ((T", {s^, . . . , s^)), (T, /i, (sq, . . . , Sfc))) > e} = cx). (8) 

71— J-OO 

For each n, fc > 0, recall that T^JS*^] is the subtree of T" spanned by and let £1 be the 
restriction of to r"|S'^']. Also, let TlSkj be the subti-ee of T spanned by Sk = {sq, . . . , Sfc}, 
and let £k be the length measure on TlSfc]. In the space in which (8) almost surely holds, we 
immediately have 

sup lim 4+i((rl5a,^L(so,---,5fe)),(mL4,(so,...,s,))) = 0. (9) 

For each n let 7"' be a Poisson process on T" x [0, oo) with intensity measure £" (g) dt. Then T'" 
converges in distribution to V in the sense of uniform convergence on compacts. 

Recall that we have enumerated the atoms of V as {{pi, Ti),i € N); likewise, for each n G N we 
list the atoms of V"' as {{pf, r"). i £ N). We noted above that a.s. for each i e N, pi has degree two 
in T and in J>;-. Since T is compact, yet another application of Skorohod's theorem then implies 
that we may find a space in which in addition to (8) and (9), almost surely for each e > we have 

lim inf{i : |rf -T,| > e} = oo, (10) 

n— ^OO 

for each A; > we have 

lim inf {z e N : in^^l n {pi, . . . ^ |T"[5a n W, . . . ,^^1} = oo, (11) 
and for any fixed fc > 0, i > 1, writing 

Uk,i^ {so,---,Sk,Pi,---,p7) and UkA = {so, ■ ■ ■ , Sk,Pi, ■ ■ ■ ,Pi), 

we a.s. have 

((T", d", Ul,), (r, d, M, Uk.d) ^ 0, (12) 

as n — !■ oo. 

To sum up: by a sequence of applications of Skorohod's theorem we have arrived at a space 
in which, after rescaling, the sequence T" converge almost surely to a Brownian CRT T. We 
have additionally coupled a sequence of random draws from the mass measure of T to its discrete 
counterpart, and a Poisson process on skel(T) x [0,oo) to its discrete counterpart, in such a way 
that any finite collection of such points in the limiting space is arbitrarily closely approximated by a 
corresponding (in both the informal and the technical sense) collection of points in T", for n large 
enough. Furthermore, we have done so in such a manner that for any fixed t > and fc > 1, the 
operation of restricting the Poisson process to the set of points arriving before time t and falling 
within the subtree spanned by the first fc random draws from the mass measure, commutes with 
taking the large-n limit. 
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5.3 The convergence of the discrete backbones 

In this section we continue to assume that T" is a conditioned Galton-Watson tree with critical, finite 
variance offspring distribution ^. Before proving Theorem 5.1, we also need to express the modified 
Aldous-Broder dynamics in the setting of conditioned Galton-Watson trees. The only minor issue 
which needs to be addressed is the fact that the modified Aldous-Broder dynamics should ignore 
points of 7-"" which fall in an already cut subtree. 

First, we consider the planted tree T"(p"), and call the planted vertex w". We extend /i" to 
t;(r"(p")) by setting /i"({it;"}) = 0. Recall the notation a{v) for the parent of vertex v. For each 
< t < oo, let T" be the component of 

(t;(r"(p")), e(T"(p")) \ {(pr, abD) : < r, < t}) 

containing w", and define Tp_ accordingly. (The forest in the preceding equation is the finite-n 
analogue of J^^, but will not be used in what follows.) Write 

/r = {z e N : < < t, ^^"(T,") < Ai"(T,"_)}, 

for the indices corresponding to 'effective' cuts up to time t, and let 

be the set of locations of these cuts. For i G let a:" = pf and let be the parent of x" in T" 
(here we view p" as its own parent). Then, for < t < oo, let 

= (^;(T"),e(r") \ : < r, < t}), 

and for i e write /" for the component of containing x". Note that /" is in fact a component 
of for all <t<oo. 

Write k" — |/^|, and write tt" for the permutation of that reorders the elements of in 
increasing order of the corresponding cut time, so that for i,j g I^^, 7r"(i) < 7r"(j) if and only if 
r" < r". Also, write 

and 

Finally, let T" be the tree obtained from by removing w", then adding the edges 

(^""(i+l)' ^""(i)) 1 < « < k"- 

We view T" as rooted at u". 

Remark. It is a standard fact that if ^ is a mean-one Poisson distribution (in fact, the mean does not 
matter) then T" has the same distribution as the tree obtained from a uniform Cayley tree on [n] by 
removing the vertex labels. In this case. Theorem 3.1 then implies that T" is distributed as a uniform 
Cayley tree with labels removed, and v" is a uniformly random element of v{T"), independent of 
T". This fact will be used in the course of the proof of Theorem 5.1 in Section 5.4. However, it 
plays almost no role in the current section. In particular, all results presented in this section, with the 
exception of Corollary 5.6, are valid for general critical, finite-variance conditioned Galton-Watson 
trees. 

Lemma 5.3. In the space where (8)-( 12) hold, almost surely 

i^^-{T^),t>0)^{^^{Tt),t>0), 
in the sense of uniform convergence on compacts for the Skorohod Ji topology. 
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Proof. Write Vk for the uniform measure on points sq , ■ • • j Sfc . In other words, given T and sq , . . . , Sfc , 
Vk assigns mass l/(fc + 1) to each of the points sq, . . . , s^. Similarly, write for the uniform mea- 
sure on Sq , . . . , s^. By (12), for any fixed i>l, almost surely 

lim dGHp((T",d",z.;\C/^\J,(r,d,i^fe,C/fe..)) = 0. (13) 

n— J-oo 

Also, by Theorem 8 of [8], for almost every realization of T, 

lim dpK-,/i) -0. (14) 

(In fact, in [8], only almost sure weak convergence is claimed, but the proof simply consists of an 
application of the Glivenko-Cantelli theorem and is easily seen to yield convergence with respect to 
dp.) Since for all i > 0, 7t is a compact subspace of T, and the % are decreasing in t, it follows 
that 

{yk{Tt).t>Q)^{ti{Tt),t>Q) (15) 
as fc — > oo. Combining (13) with (10) and (11), we obtain that for each fc > 0, almost surely 

K'(T,"),i>0)^K(7;),i>0) (16) 
as n — > OD. Next, combining (13) and (14) with (12), we obtain that almost surely 

lim lim dGHp((T",d",/i",C/fe.),(r,d,z^fe,C/M)) = 0, 

which together with (13) implies that almost surely 

lim lim sup dp (^", z^^') = 0. 

k ^oo — 

In view of (15) and (16), this proves the lemma. □ 

Next, for each n > 1, reorder the elements of V" as {{pn,i^ tn.i), i > 1} so that < 
for all i > 1. We emphasize that here we consider all atoms of V", not only those that correspond 
to "effective cuts". 

Lemma 5.4. In the space where (8)-(12) hold, a.s. 

V ^ {r-t^,,<t} J 
in the sense of uniform convergence on compacts for the uniform distance, as n oo. 
Proof. From Lemma 5.3 it is immediate that 



^ Ai"(r;)ds,t > 0^ ^ (^^ ^i{Ts)ds, t > 



as n — > oo. Also, ^"(r") = dy/n, so the set {r", i e N} forms a Poisson point process of intensity 
a■^/n on [0, oo), from which it follows straightforwardly that 



t > 



and the result then follows from (7) and the definition of L{t) in (5). □ 
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Our next goal is to show that {L(t), t > 0) is the limit of the discrete process which tracks the 
number of effective cuts up to time ty^. Write 

and note that, for every n > 1, L"(<) increases to k"(T") = #{s > : /^"(T]^) < Af"(TJ!_)}, as 
t — oo. 

Theorem 5.5. In the space in which (8)-( 12) hold, a.s. 

{U\t)/{an^/^),t> 0) ^ {L{t),t> 0) 
in the sense of uniform convergence on compacts for the uniform distance, as n oo. 

In proving Theorem 5.5 we will use a martingale inequality from [33], Theorem 3.15. Let 
{^i}r=o ^ bounded martingale with = 0, adapted to a filtration {^ilf^p. Next let V = 
Y.7=o ^ [^'+1 1^*]' where 

V [x,+,\g,] := E - = E [xl,\g^] - xf 

is the predictable quadratic variation of Xi+i. Define 

w = esssupy, and b= max esssup(Xi+i — X^j^i) , 

0<i<n — 1 

where for a random variable X, the essential supremum ess sup X is defined to equal sup{a; : 
P {X > x) > 0}. Then we have the following bound [33]. For any t > 0, 



P max Xi > t ] < exp 

\0<i<n ' 



2v{l + bt/{3v)) 



(17) 



Proof of Theorem 5.5. In a first part, we prove uniform convergence on compacts for which we do 
not need the trees T", n > 1, to be uniform Cayley trees. Fix 6 > and C > 0. By Lemma 5.4, a.s. 



sup 

0<t<C 



awn 



{j-t^,j<t} 







as 71 — > cx). It follows that 



P ( limsup sup 

n->co Q<t<C 



L-{t)~L{t) 



> S 



(18) 



< P I limsup sup 

n-i-oo 0<t<C 



{j-t^.j<t} 



> 



Also, since has intensity measure ^" ® dt and £"(u(T")) = an^^^, we have that 

lim P(liminf t„ 1^ /jjj > C) = 1 , 

which impUes that the probability in (18) is at most 



lim P I limsup max 



1<J<1 



> 



(19) 



For i > 1, write 
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Also, for each i > 1, let — {pn,i, ■ ■ ■ ,Pn.i}- Taking Qn,i to be the sigma field generated by T" 
and V^, then {Xi,i > 1) is adapted to {Gn.i, « > !)■ Note that 

E [X, I g„^,_i] = A*"(T/^_ J = E [xf I g„,,_i] , 

so in all cases Var [Xi \ Gn.i-i] < 1/4- Also, for alH > 1 we have X)j=i -^j — -^"(^ri,i)- ^^Y (1"^)' 
for any fixed a; > and n > 1 we thus have 



P I max 



j<i j<i 



>y\ <2cxp - 



2y^ 



xy^{l + 4y/(3a;V")). 
Applying this bound with x = 6y/n and summing over n, it follows by Borel-CantelU that 



(20) 



P I limsup max 



l<j<i 



> 



6n'/^ = 0, 



which together with (19) shows that {L'^{t)/{an'^/'^),0 < t < C) ^ iL{t),0 < t < C) almost 
surely for the uniform distance. □ 

Corollary 5.6. If ^ is the Poisson{\) distribution then in the space in which (8)-(12) hold, a.s. 

{U\t)/n^'^,t>{))^{L{t),t>Q) 
in the sense of uniform convergence on [0, oo). 

Proof. If ^ is the Poisson(l) distribution then a = 1. Uniform convergence on compacts follows 
from Theorem 5.5. Furthermore, as noted in the remark just before Lemma 5.3, in this case T" 
is distributed as a uniform Cayley tree on [n] with labels removed. Also, T" is again distributed 
as a uniform Cayley tree with labels removed, and k"(T") is the distance between u" and in 
T", it follows from Theorem 3.1 that ^"(T")/^^/^ converges in distribution to a Rayleigh random 
variable. 

Next, fix e > 0. For any t > 0, given that n''(Tp) < en, the difference k"(T") - is 
dominated by the number of cuts to isolate the root of a uniform Cayley tree on [enj vertices. It 
follows that for any e > 0, 



lim limsupP («:"(T") - > en^^A = 0. 



(21) 



By the principle of accompanying laws (Theorem 9.1.13 of [41]), in the space in which (8)-(12) 
hold we have 

L(oo) = lim L{t) lim "—^^^ 

t^oo n— ^C30 Tl ' 

which together with (21) implies uniform convergence on [0, oo). (This also yields a second proof 
that _L(oo) is almost surely finite, as promised just after (5).) □ 

Before proving Theorem 5. 1 we note one consequence of Corollary 5.6, stated in the introduction 
as Theorem 1.6. A different proof of this result can be found in Abraham and Delmas [1]. 

Corollary 5.7. It holds that /i(7()dt has standard Rayleigh distribution. 

Proof. In proving Corollary 5.6 we showed the existence of a space in which 

L(oo) = lim — - , 

and the latter limit is Rayleigh distributed by Theorem 3.1 The lemma then follows from the defini- 
tion of L(t) in (5) and (7). □ 
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5.4 The proof of Theorem 5.1 

In this section, in order to use the discrete results of Section 3, we assume that ^ is the Poisson(l) 
distribution, or equivalently (see the remark just before Lemma 5.3) that T" is a uniform Cayley 
tree on [n] with its labels removed. In particular, this implies that a — 1. 

Recall the definitions of the trees {fi, i £ Irx,} and {/", i £ from pages 16 and 19 (here we 
simply view each ft as a subset of T). Also, write dn for n^^/^ times the standard graph distance 
on T", and write fi" for the uniform probability measure on w(T"). 

We work in a space where (8)-(12) all hold. For any e > 0, let 

Je = {i e /oo : Moo(/i) > e}- 

The set is necessarily finite (it has size at most e^^), so K'-'^'' :— sup{i : i G J^} is a.s. finite. By 
(11), for all n sufficiently large we in particular have that C and we hereafter assume that 
inclusion indeed holds. 

Let S — {u, v} U IJjg J fi, and let — [J^ yesl^' 2^1 ■ ^ words, is the minimal subtree of 
T which contains each of the subtrees fi, i £ and also contains the distinguished nodes u and v. 
Likewise, let 



{u\v-}U U v{fr 



We let = d\f , and define fi^, d", /i" accordingly. 

The set I^o is countable and Jg f ^oo as e i 0. Also, it follows from the result of Aldous and 
Pitman [8] mentioned earlier that J^iei Moo(/i) = 1 a.s., and we thus a.s. have 

lim^ Moo(/i) = 0. 

Since T is compact and each fi can be viewed as a subtree of T, we must also a.s. have 

lim sup diam (/.;) — 0. 

By these facts and by (12), for any 6 > there is = N{e, 6) which is almost surely finite, such 
that for all n> N, 

((T", d", m", Ul,), (r, d, f,, C/fc,,)) < S, (22) 

and additionally J2i^j Moo(/i) < 5 and supj^j^ diam(/i) < S. We fix a correspondence C G 
"^((T", d", /i", UJ^i), (T, /i, Uk.i)) with dis(C) < 25 and containing the appropriate pairs of points 
from [/^j and Uk.i- It follows from the fact that supj^j^ diam (fi) < S that 

dcHpHTj^fj; {u,v)), {f^,d^,fie, {u,v))) < 6 (23) 

and that 

sup n-i/^diam (/") < 35. (24) 

Next, write mg ~ sup^^-^- fj,{B{x, 5) ), where B{x, 5) is the ball of radius 5 around x in T. We have 
ms i a.s. as (5 — > 0. Choose < 5 < small enough that m^s < e^. Then for n > N{e, 5), and 
for all i £ Jg, by considering the 5 blow-up Cs of the correspondence C, we see that 

4Hp((/r,d^l/^A^LI/r).(/«.^oo|/.,/^oo|/J) < + < (25) 
In particular, for each i e J^, |^Sd(/i") ~ Moo(/i)| < 2e^, so 
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and 



J2 <2e + <5<3e. (27) 



By (24) and (27), it follows that for all n sufficiently large, 

rfcHP ( (T" , d" , A" (T," , rf^ 3 (5 + e) < 6e. 

For each i e 1^0, L{t) = d{u, Xi) and for each i € J^^, = x"). By Corollary 5.6, 

it follows that for all i E J^, for all n sufficiently large, \d{u, Xi) — d"'{u", < 5. together with 
(25) and (26), this implies that 

4Hp((r:\ rf^ A", {u",v^)), {t, I, fi,, {u, v)))< max(,5 + 2e', 2e) < 3e. 

By the two preceding inequalities, (23), and the triangle inequaUty, we obtain that a.s. for all n 
sufficiently large, 

dl^p{{T",d'', fil", (u", v^)), if, I A, v))) <%e + 5< lOe. 

Since e > was arbitrary, the first assertion of the theorem then follows from Theorem 3.1. 

Finally, since the distribution of the collection {yi,i € loo} is determined by its finite dimen- 
sional distributions, the assertion in the statement of Theorem 5.1 about the collection i E /qo} 
then follows from Lemma 5.8, below, whose straightforward proof is omitted. 

Lemma 5.8. Fix n > 1, k > 1, let K = {i E : E T"|S'^]}, and let j E K be the 
element i E K which minimizes r". Suppose that T" is a uniform Cayley tree on [n]. Then for any 
S C w(T"), any tree t with v(t) — S, and any y E S, 



P (t;J. = t andy] = y\ t,(r;^) = s) = \S[ 



\s\ 



6 Conditioned Galton- Watson trees with finite variance 

We now want to prove that the picture that we have obtained for the process in the case of uniform 
Cayley trees is also valid when one considers conditioned Galton-Watson trees with critical, finite 
variance offspring distribution. Fix an offspring distribution C = ('?0j Cij • ■ ■ ) with 

and ^ i(z - l)e, - e (0, oo). 

i>0 'i>0 

Theorem 6.1. Let T" be distributed as a Galton-Watson tree with offspring distribution ^, condi- 
tioned to have n vertices. Then after rescaling, the number of cuts k{T^) required to isolate the root 
ofT^ is asymptotically Rayleigh distributed: 

lim P (K(r") > ax^/n) = e-""^ '"^ . 

n— )-oo 

Under a finite variance assumption, Galton-Watson trees conditioned on their size have the same 
scaling limit as uniform Cayley trees, so when looking at a (n, ^Jn) rescaling for time and space, 
the cutting process will essentially look the same. Completing the argument then boils down to 
showing that once the left-over tree has size o{n) the number of cuts needed to completely destroy 
it is o{s/n). The following lemma shows that this is indeed the case. (Although the factor e^/^ is 
certainly not best possible, it is sufficient for our needs.) 

Lemma 6.2. Suppose that = 1 and Var [^] = cr^ G (0, oo). Let T" be a Galton-Watson tree 
with progeny distribution ^, conditioned on having size n. Let also r"(e) = inf{< : /i"(r") < e}. 
Then, 

lim sup P (n{Tl\,.) > ei/^V^) > 0. 

n-H-oo V ^ ^ / e-J-0 
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Proof. Recall that for a rooted tree T and a node v of T, we write hT{v) for the height of v 
in T, which is the number of edges on the path from the root to v. We also write h{T) = 
max„g„(T) hriv), and call h{T) the height of T. Finally, for i > write Wi(T) = #{w G : 
hriv) = i}. 

For any a;, y > we have 



P Mr;-.(e))> 



(28) 



The first term above is easily bounded using Markov's inequality. We use Janson's representation 
of the number of cuts as records in the tree [23, 24]. Given a tree t, rooted at r, one can assign 
extra labels to the vertices using a random permutation of{l,2,...,|t|}. This random permutation 
determines the order in which the vertices are considered for cutting. In this representation, a vertex 
u will actually produce a cut if and only if the path Jr, uj between u and the root has not been 
previously cut. This happens precisely if u has the minimum label of all vertices on |r, it]. In 
particular, conditional on the height /if (u) of u in t, the probability that a vertex u produces a cut is 
{ht{u) + 1)"^ It follows that 



p > y^, /t(T;„(,)) < x^^ 

1 



< 



< 



< 



< 



< 
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y^/n 
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yV^ 
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E 
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(e)M{/i(T;',.(^))<a;v^} 
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1 + hr, 
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"T rT-'-{/i(T"„, J<a:\/ri} 
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EK(T«)] 
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Ca; 



(29) 



we used the fact that E[wfe(T")] < Cfc uniformly in fc > and n > (see Devroye and Janson 
[17]) to obtain the second-to-last inequality. 

To bound the second term, we relate the finite-n trees T" to their limit T- We work in a space 
in which (8)-(12) all hold, and recall from Section 5.2 the definitions of the collections of points 
{si,i > 1) and {p" : i e N}, and of their finite-n counterparts (s",i > 1) and {p" : i € N}. In 
particular, recall the definitions of the sequences Sk, SJ^, from page 18. 

We now use that for all (5 > 0, 

lim P {dU(.r,d,p),{TlSkld\ris,hP) > S) = 0. 

k^oo 

By (8), we then also have that 



lim limsupP dGH((r",d",p"),(T"Kl,d"|r"is^.j,p") ><5 = 0, 



Equations (10), (11), and (12) provide a coupling of the cuts falling on T'^JS*^] with those falling 
on TJS'fe] so that for any fixed t > and for all sufficiently large n, the cuts falling within T"J5[?] 
and within T[[5fc] occur at essentially the same times and at essentially the same locations. (This is 
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precisely formalized by (10), (11), and (12).) It then follows that in this space, for any e > and 

S>0, 



limsup P 4h((T." d"|T;.„, , , P"), (r.(.), dir.,., , P)) >S)=0. 
Taking 6 — Xy/e, from this we immediately obtain that 

2x 



limsupP ( > —V^j < P iHTr{e)) > xV~e) 

<e-"^', (30) 



for some constant a > 0. The last inequality holds since since: conditional on its mass, 7^(e) is 
a Brownian CRT (see [8], equation (44)); we have < e; the height of a Brownian CRT is 

distributed as the supremum of a Brownian excursion; and the supremum of a Brownian excursion 
has Gaussian tails [25]. 

Then, choosing for instance x = t^l'^ and y = t^/^ and using the bounds in (29) and (30) to 
bound (28) proves the result. □ 

Putting together Corollary 5.7 and the following lemma then yields Theorem 6. 1. 

Lemma 6.3. Let T" be a Galton-Watson tree with offspring distribution ^ conditioned to have size 
n, and let T be a Brownian CRT. //E^ = 1 and Var(^) = (7^6 (0, oo) then 

Proof. Write for the subtree containing the root at time t of the cutting process, and as in Sec- 
tion 5.3 write 

for the number of cuts occurring before time t. Theorem 5.5 implies that for any fixed t £ [0, oo) 



awn 



as 71 — > oo. 

Recall that T"(e) = inf{t : /i"(T") < e}. Since T"(e) < oo almost surely, we have 
On the other hand, 

K{T-)-L-{r-{e)) k{T- ) 

7^ < T-^ > 0, 

^yn \Jn e-i-o 

in probability, uniformly for all n sufficiently large, by Lemma 6.2. Since r"(e) — > oo almost surely 
as e ^ 0, it follows that 

k(T") d 



as n cx), as claimed. □ 
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A Excursions, bridges, trees and forests 

In this section, we describe the transformations of Section 5 in the language of excursions. This 
perspective on the results serves two purposes. First, in the excursion framework, a similarity is 
immediately apparent, between the results of the current paper and results of Aldous and Pitman 
[7] on scaling limits of random mappings and on decompositions of reflecting Brownian bridge. 
Though there seems to be no direct link between the main results of the two papers, the idea that 
they may possess a common strengthening is intriguing. Second, as noted in the body of Section 5, 
a careful reader may have had questions about the precision of the definitions of some of the random 
objects under consideration, and the excursion-theoretic description clarifies such matters. 

Let e = (e(t), < i < 1) be a standard Brownian excursion, and write Te for the M-tree coded 
by e. (We recall that the points of Te are equivalence classes {[x],0 < a; < 1}, where points 
x, y e [0, 1] are equivalent if e{x) — e{y) ~ inf{e(z) : x < z < y}, and refer the reader to [30] for 
more details of this standard construction.) 

Next, let Ae — {{s, y) € [0, 1] x M+ : < y < e(s)} be the set of points lying above the x-axis 
and below the graph of e. For each point {x, y) in A°, the interior of A^, let 

s{x, y) = s{x, y, e) — inf{a;' : x G (0, x), e{z) > y^ z G [x , x]} , and let 
s{x, y) = s(x, y, e) — sup{a;' : x' G {x, 1), e{z) >y'iz<E [x, x']} . 

In other words, the line segment [s{x , y) ,^{x , y)] x {j/} is the maximal horizontal line segment 
through [x, y) contained in Ae- 

We wish to obtain an excursion-theoretic representation of the Poisson process on skel(7^) x 
[0, oo) with intensity measure £® Leb[o_oo)j where £ is the length measure on skel(7^) and Leb[o,oo) 
is Lebesgue measure on [0, oo). To do so, for {x, y) € A°^, we view the points of [s(x, y),s{x, y)) x 
{?/} as representing the point [s{x, y)] of skel(7^). We then consider a process VI which, conditional 
on e, is a Poisson process on A% x [0, oo) with intensity measure at ((x, y),t) given by 

dLeb_4o ® dLeb[o^oo) 
s{x,y,e) - s{x,y,e) ' 
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Fort e [0,oo), let 

Xt = Xtie,r:) ^ {z e [0,1] ■.3iix,y),s) er!,s <t,z e [six, y),six,y)]}. 

In words, the (equivalence classes of) points of Xt are the points of Te lying in subtrees that have 
been cut by r° by time t. We define Xt- accordingly, let Yf = [0, 1] \ Xf and let Yt^ = [0, 1] \ Xt- . 

Next, for < t < oo, let nit — Leb[o^i] (Vt) be the Lebesgue measure of the points that are not 
yet cut at time t, and let nit- = Leb[Q (Yt-). Then let Ve — {p — {{x, y), t) ^ V° : mt < nit-} 
for the set of points that reduce the measure of the "uncut subtree". We next explain how the points 
of Ve yield a family of transformations of the excursion e. 

For z E Yt, the closure of Yj, let Vt{z) — Leb[o ([0, z] n Yj). The function Vt Yt —?' [0, mt] is 
non-decreasing. Furthermore, the results of [8] imply that Vt{l) — rat ™d that for < z < z' < 1 
we have vt{z) — vt{z') if and only if there exists {x, y) G A° such that z = s{x, y) and z' — s{x, y). 
In other words, vt{z) — vt{z') precisely if [z] = [z'] is the root of a subtree that is cut before or at 
time t. 

Let £(:[! — nit, 1] [0, oo) be given by setting ej(z) = e(w(r^(z — (1 — mt))), where 
f("^(u) — mi{x : Vt{x) > u} (we could in fact take Vt^^{u) to be any point in the pre-image of 
u under vt; the comments of the preceding paragraph show that the value of e{v^^{u)) does not 
depend on this choice.) Then Theorem 4 of [8], together with the comments in Section 3.5 of that 
paper, implies that conditional on nit, if the function is translated to have domain [0, nit] then 
the result is distributed as a standard Brownian excursion of length nif We define mt-, Vt- and the 
excursion ej_ similarly. 

Next, for each point p = ((x, y), t) of T-'e, we define a random function with domain [1 — 

mt-, 1 — mt] as follows. For z E [1 — mt-, 1 — mt], set 

eP{z)^et-iv-_\s{x,y)) + z). 

Notice that 

(1 - mt) - (1 - mt-) = mt- - mt = Vt-{s{x,y)) - Vt-{s{x,y)). 

Translated to have range [0,vt-{s{x,y)) ~ Vt-{s{x,y))], the excursion then codes the tree cut 
by point p under the standard coding of trees by excursions. 

Finally, for t E [0, oo) let et : [0, 1] — > [0, oo) be the unique function such that et[[i-mt,i] = 
and such that for each p = {x, y, s) E Ve with < s < t, 

et\[i-mt-,i-mt] = ■ 
The function et is the "concatenation" of the functions 

{eP,p=ix,y,s)EVe : < s < t} 

and of the function ej. We define the function et- similarly. The function et is comprised of 
a countably infinite number of excursions away from zero; the trees coded by these excursions 
together comprise the M-forest {Tt, dt,IJ-t) of Section 5. A similar coding of a random continuum 
forest, by a reflecting Brownian bridge conditioned on its local time at zero, is described in [8], 
Section 3.5. 

The random variables {et,t > 0) are consistent in the sense that for any fixed s E [0, 1), there 
is an almost surely finite time to such that for all t' > t > to, et'|[o,s] = et|[o,s]- It follows that the 
limit = limt_>.oo et is almost surely well-defined. 

In the current terminology, for < t < oo, have 

L{t) = / mgds. 
Jo 

We view et[io.i-mt] = |[o,i-?rit] as coding a random measured M-tree with mass 1 — mt, as 
follows. Let df : [0,1 — mt] — [0, oo) be given by setting, for 0<u<w<l — mt, 

dt{u,v) = et(v) + et{u) — 2 inf et{s) + L{v) — L{u). 

u<s<v 
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Then the tree (7t, dt,fJ,t) of Section 5 may be defined as follows. Set Tt — {[u], < u < 1 — rrit}, 
where [u] denotes the equivalence class of u: [u] — {0 < v < 1 — nit : df{u, v) — 0}. Let dt be the 
push-forward of d^ to Tt, and let jit be the push-forward of Lebesgue measure on [0, 1 — rrif] to 7*. 

The content of the first assertion of Theorem 5 . 1 is that Coo is distributed as a reflecting Brownian 
bridge; we may see the equivalence between the first part of Theorem 5.1 and the latter statement 
as follows. First, a standard and trivial extension of Theorem 5.2, states that a uniformly random 
doubly-marked tree on [n] converges to (T, d, /i, {p, p')) with respect to d^^^^, where (T, d, p.) is a 
Brownian CRT and p, p' are independent elements of T, each with law p. Next, recall the stan- 
dard one-to-one map between doubly-marked trees on [n] and ordered rooted forests on [n] which 
"removes the edges on the path between the two marked vertices". Finally, results from [7] - in 
particular, the first two distributional convergence results in Theorem 8 of that paper, together with 
the remark in Section 10 - imply that that the contour process of a uniformly random ordered rooted 
forest on [n] converges after appropriate rescaling to a reflecting Brownian bridge. (We remark that 
a direct encoding of a doubly-rooted Brownian CRT by reflecting Brownian bridge, also mentioned 
in the introduction, is given in [12]. The latter is closely related to, but distinct from, the encoding 
obtained by considering ordered rooted forests as above.) 

Next, for each point p = {{x, y),t) e Ve, let 

Up = 1 - TOt + Vtis{x,y,e)) € [1 - mj, 1]. 

If we view as coding a tree, then the (equivalence class of the) point Up is a leaf of this tree. Then 
let Up = yp{e, Ve) be the push-forward of Up under the map that sends e^. In other words, 

let p' = {{x' , y'), t') be the a.s. unique point of Ve with t' > t, with s{x' , y' , e) < s{x, y, e), with 
s{x' , y' , e) > s{x, y, e), and minimizing t' subject to these constraints. Then we set 

yp{e,Ve) = 1 - mf - + vt'-{s{x,y,e)) - Wf- (s(x', y', e)) 

The second assertion of Theorem 5.1 is that conditional on Coo, the law of {yp,p E "Pe} is the same 
as that of the following family of random variables. Let Z = {z G (0, 1) : eoo{z) = 0}. Then 
independently for each z E Z let be uniform on [z, 1]. 

We remark that a related family of random variables plays a role in Theorem 8 of [7] (in partic- 
ular in the third distributional convergence of that theorem). The latter theorem, which describes a 
distributional limit for uniformly random mappings of [n], has several suggestive similarities to our 
main result. We do not see any any direct relation between the distributional limits described in that 
paper and those established here. Establishing such a relation would certainly be of interest, and 
would likely yield insights in both the discrete and limiting settings. 
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